منابع مشابه
On time-frequency masking in voiced speech
This paper addresses the issue of masking of noise in voiced speech. First, we examine the audibility of cyclostationary narrow-band noise bursts added to voiced speech generated by synthetic excitation. Varying the temporal location of noise within a pitch cycle corresponds to varying its phase spectrum. Using this fact, we found that a change of phase of the noise in the high frequency region...
متن کاملRobust speech separation using time-frequency masking
A multi-microphone time-frequency speech masking technique is proposed. This technique utilizes both the timefrequency magnitude and phase information in order to estimate the Signal-to-Noise Ratio (SNR) maximizing masking coefficients for each time-frequency block given that the direction (or alternatively, the time-delay of arrival) of the speaker of interest is known. Using this masking algo...
متن کاملA consideration on time-frequency masking methods for speech separation
Time-Frequency Masking methods, primary known as DUET [2] and SAFIA [3], are effective scheme for blind speech separation problem. Based on an investigation of conventional delay-histogram and the time-frequency masking method in terms of estimated delay accuracy, two novel approaches for clustering process are proposed. In particular, the proposed methods tend to improve relatively large amoun...
متن کاملTime-frequency masking for large scale robust speech recognition
Time-frequency mask estimation has shown considerable success recently. In this paper, we demonstrate its utility as a feature enhancement frontend for large vocabulary conversational speech recognition. Additionally, we investigate how masking compares with feature denoising, which directly reconstructs clean features from noisy ones. We train a mask estimator that predicts ideal ratio masks. ...
متن کاملPerceptual speech coding using time and frequency masking constraints
This paper presents a new wide-band speech coding system based on a fast wavelet packet transform algorithm as well as a formulation of temporal and spectral psychoacoustic models of masking. The proposed FFT-like overlapped block orthogonal transform allows us to approximate the auditory critical band decomposition in an e cient manner, which is a major advantage over previous approaches that ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Speech and Audio Processing
سال: 2000
ISSN: 1063-6676
DOI: 10.1109/89.848218